String-Oriented Databases
نویسنده
چکیده
Relational databases and Datalog view each attribute as indivisible. This view, though useful in several applications, does not provide a suitable database paradigm for use in genetic, multi-media or scientific databases. Data in these applications are unstructured; querying on sub-strings of attributevalues is often necessary. Moreover, due to imprecision and incompleteness in the data, approximate reasoning also becomes indispensable. Our aim is to view strings as database objects that can be compared, divided, subsumed, interpreted and approximated. Allowing such operations on strings enriches the semantics and increases the expressive power of database languages. In this paper we develop an extension to the relational algebra augmenting it with the concept of a string expression with a rich structure of string variables, mapping functions, interpreted string operations and approximate evaluations. We study properties of such expressions and show that many of the well-known properties of relational algebra hold in the extension. We also discuss an extension to Datalog(String) and an implementation of a proto-type system called S-log. S-log integrates pattern-matching in Datalog framework. We contend that string-oriented database systems would be useful in applications that require efficient sub-structure analysis, such as aligning DNA strings using motifs, retrieving and synthesizing iconic images based on content.
منابع مشابه
Relative coordinates oriented symbolic string for spatial relationship retrieval
Issues involving efficient data storage and retrieval have become increasingly important in the design of image database systems. The two dimensional B-string, a kind of symbolic picture representation, has been utilized for both characterizing the spatial knowledge embedded in images and for describing images with either partially or completely overlapping objects, without the need to partitio...
متن کاملString Oriented Programming: Circumventing ASLR, DEP, and Other Guards
The protection landscape is changing and exploits are getting more and more sophisticated. Exploit generation toolkits can be used to construct exploits for specific applications using well-defined algorithms. We present such an algorithm for leveraging format strings and introduce string oriented programming. String oriented programming takes format string exploits to the next level and turns ...
متن کاملFull-Text Search Engines for Databases
Current databases are able to store several Tbytes of free-text documents. The main purpose of a database from the user’s viewpoint is the efficient information retrieval. In the case of textual data, information retrieval mostly concerns the selection and the ranking of documents. The selection criteria can contain elements that apply to the content or the grammar of the language. In the tradi...
متن کاملMulti-oriented touching text character segmentation in graphical documents using dynamic programming
The touching character segmentation problem becomes complex when touching strings are multioriented. Moreover in graphical documents sometimes characters in a single-touching string have different orientations. Segmentation of such complex touching is more challenging. In this paper, we present a scheme towards the segmentation of English multi-oriented touching strings into individual characte...
متن کاملExtracting Syntactic Patterns from Databases
Many database columns contain string or numerical data that conforms to a pattern, such as phone numbers, dates, addresses, product identifiers, and employee ids. These patterns are useful in a number of data processing applications, including understanding what a specific field represents when field names are ambiguous, identifying outlier values, and finding similar fields across data sets. O...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999